Research on Constructing “Parallel Contrast Corpus of Grammatical Errors”
نویسندگان
چکیده
منابع مشابه
Constructing Parallel Corpus from Movie Subtitles
This paper describes a methodology for constructing aligned German-Chinese corpora from movie subtitles. The corpora will be used to train a special machine translation system with intention to automatically translate the subtitles between German and Chinese. Since the common length-based algorithm for alignment shows weakness on short spoken sentences, especially on those from different langua...
متن کاملOn Constructing a Theory of Grammatical Change
The last few decades have seen the growth of a community of linguists who, though diverse in their beliefs and assumptions about other aspects of linguistics, nevertheless share a commitment to the construction as the basic unit of linguistic analysis. In construction grammar (henceforth CxG), as this family of approaches is known, ‘construction’ is understood – beyond its pretheoretical sense ...
متن کاملComparing k-means clusters on parallel Persian-English corpus
This paper compares clusters of aligned Persian and English texts obtained from k-means method. Text clustering has many applications in various fields of natural language processing. So far, much English documents clustering research has been accomplished. Now this question arises, are the results of them extendable to other languages? Since the goal of document clustering is grouping of docum...
متن کاملConstructing of a Large-Scale Chinese-English Parallel Corpus
This paper describes the constructing of a large-scale (above 500,000 pair sentences) Chinese-English parallel corpus. The current status of Chinese corpora is overviewed with the emphasis on parallel corpus. The XML coding principles for Chinese–English parallel corpus are discussed. The sentence alignment algorithm used in this project is described with a computer-aided checking processing. F...
متن کاملConstructing a Chinese―Japanese Parallel Corpus from Wikipedia
Graduate School of Informatics, Kyoto University Yoshida-honmachi, Sakyo-ku, Kyoto, 606-8501, Japan E-mail: {chu, nakazawa}@nlp.ist.i.kyoto-u.ac.jp, [email protected] Abstract Parallel corpora are crucial for statistical machine translation (SMT). However, they are quite scarce for most language pairs, such as Chinese–Japanese. As comparable corpora are far more available, many studies have ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Language Teaching and Research
سال: 2020
ISSN: 1798-4769
DOI: 10.17507/jltr.1105.10